Picture for Tao Yu

Tao Yu

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Add code
May 28, 2026
Viaarxiv icon

FineVLA: Fine-Grained Instruction Alignment for Steerable Vision-Language-Action Policies

Add code
May 26, 2026
Viaarxiv icon

CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents

Add code
May 25, 2026
Viaarxiv icon

Any2Any: Efficient Cross-Embodiment Transfer for Humanoid Whole-Body Tracking

Add code
May 22, 2026
Viaarxiv icon

BioHuman: Learning Biomechanical Human Representations from Video

Add code
May 14, 2026
Viaarxiv icon

Uno-Orchestra: Parsimonious Agent Routing via Selective Delegation

Add code
May 06, 2026
Viaarxiv icon

Realizing Immersive Volumetric Video: A Multimodal Framework for 6-DoF VR Engagement

Add code
Apr 10, 2026
Viaarxiv icon

DirectFisheye-GS: Enabling Native Fisheye Input in Gaussian Splatting with Cross-View Joint Optimization

Add code
Apr 01, 2026
Viaarxiv icon

CUBE: A Standard for Unifying Agent Benchmarks

Add code
Mar 16, 2026
Viaarxiv icon

Monocular Mesh Recovery and Body Measurement of Female Saanen Goats

Add code
Feb 23, 2026
Viaarxiv icon